Dataset statistics
| Number of variables | 28 |
|---|---|
| Number of observations | 2018245 |
| Missing cells | 17263060 |
| Missing cells (%) | 30.5% |
| Total size in memory | 431.1 MiB |
| Average record size in memory | 224.0 B |
Variable types
| DateTime | 1 |
|---|---|
| Text | 15 |
| Unsupported | 1 |
| Numeric | 11 |
BOROUGH has 627854 (31.1%) missing values | Missing |
ZIP CODE has 628092 (31.1%) missing values | Missing |
LATITUDE has 229685 (11.4%) missing values | Missing |
LONGITUDE has 229685 (11.4%) missing values | Missing |
LOCATION has 229685 (11.4%) missing values | Missing |
ON STREET NAME has 424807 (21.0%) missing values | Missing |
CROSS STREET NAME has 755532 (37.4%) missing values | Missing |
OFF STREET NAME has 1685810 (83.5%) missing values | Missing |
CONTRIBUTING FACTOR VEHICLE 2 has 307909 (15.3%) missing values | Missing |
CONTRIBUTING FACTOR VEHICLE 3 has 1875114 (92.9%) missing values | Missing |
CONTRIBUTING FACTOR VEHICLE 4 has 1986122 (98.4%) missing values | Missing |
CONTRIBUTING FACTOR VEHICLE 5 has 2009575 (99.6%) missing values | Missing |
VEHICLE TYPE CODE 2 has 376990 (18.7%) missing values | Missing |
VEHICLE TYPE CODE 3 has 1880098 (93.2%) missing values | Missing |
VEHICLE TYPE CODE 4 has 1987193 (98.5%) missing values | Missing |
VEHICLE TYPE CODE 5 has 2009835 (99.6%) missing values | Missing |
LATITUDE is highly skewed (γ1 = -20.42797789) | Skewed |
NUMBER OF PERSONS KILLED is highly skewed (γ1 = 34.05808743) | Skewed |
NUMBER OF PEDESTRIANS KILLED is highly skewed (γ1 = 41.90421138) | Skewed |
NUMBER OF CYCLIST KILLED is highly skewed (γ1 = 95.71982564) | Skewed |
NUMBER OF MOTORIST KILLED is highly skewed (γ1 = 54.57753588) | Skewed |
COLLISION_ID has unique values | Unique |
ZIP CODE is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
NUMBER OF PERSONS INJURED has 1568357 (77.7%) zeros | Zeros |
NUMBER OF PERSONS KILLED has 2015410 (99.9%) zeros | Zeros |
NUMBER OF PEDESTRIANS INJURED has 1911465 (94.7%) zeros | Zeros |
NUMBER OF PEDESTRIANS KILLED has 2016798 (99.9%) zeros | Zeros |
NUMBER OF CYCLIST INJURED has 1966117 (97.4%) zeros | Zeros |
NUMBER OF CYCLIST KILLED has 2018020 (> 99.9%) zeros | Zeros |
NUMBER OF MOTORIST INJURED has 1730540 (85.7%) zeros | Zeros |
NUMBER OF MOTORIST KILLED has 2017144 (99.9%) zeros | Zeros |
Reproduction
| Analysis started | 2023-10-02 00:42:05.923870 |
|---|---|
| Analysis finished | 2023-10-02 00:42:27.154555 |
| Duration | 21.23 seconds |
| Software version | ydata-profiling vv4.5.1 |
| Download configuration | config.json |
| Distinct | 1096801 |
|---|---|
| Distinct (%) | 54.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.4 MiB |
| Minimum | 2012-07-01 00:05:00 |
|---|---|
| Maximum | 2023-08-15 23:59:00 |
BOROUGH
Text
MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 627854 |
| Missing (%) | 31.1% |
| Memory size | 15.4 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 7.456125651 |
| Min length | 5 |
Characters and Unicode
| Total characters | 10366930 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | BROOKLYN |
|---|---|
| 2nd row | BROOKLYN |
| 3rd row | BRONX |
| 4th row | BROOKLYN |
| 5th row | MANHATTAN |
| Value | Count | Frequency (%) |
| brooklyn | 441026 | |
| queens | 372457 | |
| manhattan | 313266 | |
| bronx | 205345 | |
| staten | 58297 | 4.0% |
| island | 58297 | 4.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 1761954 | |
| O | 1087397 | |
| A | 1056392 | |
| E | 803211 | 7.7% |
| T | 743126 | 7.2% |
| R | 646371 | 6.2% |
| B | 646371 | 6.2% |
| L | 499323 | 4.8% |
| S | 489051 | 4.7% |
| Y | 441026 | 4.3% |
| Other values (9) | 2192708 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 10308633 | |
| Space Separator | 58297 | 0.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1761954 | |
| O | 1087397 | |
| A | 1056392 | |
| E | 803211 | 7.8% |
| T | 743126 | 7.2% |
| R | 646371 | 6.3% |
| B | 646371 | 6.3% |
| L | 499323 | 4.8% |
| S | 489051 | 4.7% |
| Y | 441026 | 4.3% |
| Other values (8) | 2134411 |
Space Separator
| Value | Count | Frequency (%) |
| 58297 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10308633 | |
| Common | 58297 | 0.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 1761954 | |
| O | 1087397 | |
| A | 1056392 | |
| E | 803211 | 7.8% |
| T | 743126 | 7.2% |
| R | 646371 | 6.3% |
| B | 646371 | 6.3% |
| L | 499323 | 4.8% |
| S | 489051 | 4.7% |
| Y | 441026 | 4.3% |
| Other values (8) | 2134411 |
Common
| Value | Count | Frequency (%) |
| 58297 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10366930 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 1761954 | |
| O | 1087397 | |
| A | 1056392 | |
| E | 803211 | 7.7% |
| T | 743126 | 7.2% |
| R | 646371 | 6.2% |
| B | 646371 | 6.2% |
| L | 499323 | 4.8% |
| S | 489051 | 4.7% |
| Y | 441026 | 4.3% |
| Other values (9) | 2192708 |
ZIP CODE
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 628092 |
|---|---|
| Missing (%) | 31.1% |
| Memory size | 15.4 MiB |
LATITUDE
Real number (ℝ)
MISSING  SKEWED 
| Distinct | 125750 |
|---|---|
| Distinct (%) | 7.0% |
| Missing | 229685 |
| Missing (%) | 11.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.62776338 |
| Minimum | 0 |
|---|---|
| Maximum | 43.344444 |
| Zeros | 4235 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 40.596733 |
| Q1 | 40.667923 |
| median | 40.721024 |
| Q3 | 40.7695595 |
| 95-th percentile | 40.8620661 |
| Maximum | 43.344444 |
| Range | 43.344444 |
| Interquartile range (IQR) | 0.1016365 |
Descriptive statistics
| Standard deviation | 1.980900782 |
|---|---|
| Coefficient of variation (CV) | 0.04875731808 |
| Kurtosis | 415.9795219 |
| Mean | 40.62776338 |
| Median Absolute Deviation (MAD) | 0.0513 |
| Skewness | -20.42797789 |
| Sum | 72665192.48 |
| Variance | 3.923967909 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4235 | 0.2% |
| 40.861862 | 853 | < 0.1% |
| 40.696033 | 742 | < 0.1% |
| 40.8047 | 691 | < 0.1% |
| 40.608757 | 671 | < 0.1% |
| 40.798256 | 627 | < 0.1% |
| 40.759308 | 613 | < 0.1% |
| 40.6960346 | 587 | < 0.1% |
| 40.675735 | 533 | < 0.1% |
| 40.658577 | 502 | < 0.1% |
| Other values (125740) | 1778506 | |
| (Missing) | 229685 | 11.4% |
| Value | Count | Frequency (%) |
| 0 | 4235 | |
| 30.78418 | 1 | < 0.1% |
| 34.783634 | 1 | < 0.1% |
| 40.4989488 | 2 | < 0.1% |
| 40.4991346 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 43.344444 | 1 | |
| 42.64154 | 1 | |
| 42.318317 | 1 | |
| 42.107204 | 1 | |
| 41.91661 | 1 |
LONGITUDE
Real number (ℝ)
MISSING 
| Distinct | 97829 |
|---|---|
| Distinct (%) | 5.5% |
| Missing | 229685 |
| Missing (%) | 11.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -73.75228388 |
| Minimum | -201.35999 |
|---|---|
| Maximum | 0 |
| Zeros | 4235 |
| Zeros (%) | 0.2% |
| Negative | 1784325 |
| Negative (%) | 88.4% |
| Memory size | 15.4 MiB |
Quantile statistics
| Minimum | -201.35999 |
|---|---|
| 5-th percentile | -74.0349992 |
| Q1 | -73.9749344 |
| median | -73.9273161 |
| Q3 | -73.86665 |
| 95-th percentile | -73.7631722 |
| Maximum | 0 |
| Range | 201.35999 |
| Interquartile range (IQR) | 0.1082844 |
Descriptive statistics
| Standard deviation | 3.727568036 |
|---|---|
| Coefficient of variation (CV) | -0.05054173023 |
| Kurtosis | 441.0923234 |
| Mean | -73.75228388 |
| Median Absolute Deviation (MAD) | 0.0526739 |
| Skewness | 15.98140474 |
| Sum | -131910384.9 |
| Variance | 13.89476346 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4235 | 0.2% |
| -73.89063 | 738 | < 0.1% |
| -73.91282 | 717 | < 0.1% |
| -73.98453 | 698 | < 0.1% |
| -74.038086 | 672 | < 0.1% |
| -73.91243 | 652 | < 0.1% |
| -73.89686 | 630 | < 0.1% |
| -73.9845292 | 587 | < 0.1% |
| -73.882744 | 560 | < 0.1% |
| -73.94476 | 559 | < 0.1% |
| Other values (97819) | 1778512 | |
| (Missing) | 229685 | 11.4% |
| Value | Count | Frequency (%) |
| -201.35999 | 1 | < 0.1% |
| -201.23706 | 105 | |
| -89.13527 | 1 | < 0.1% |
| -86.76847 | 1 | < 0.1% |
| -79.61955 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 4235 | |
| -32.768513 | 16 | < 0.1% |
| -47.209625 | 3 | < 0.1% |
| -73.66301 | 1 | < 0.1% |
| -73.70055 | 2 | < 0.1% |
LOCATION
Text
MISSING 
| Distinct | 274041 |
|---|---|
| Distinct (%) | 15.3% |
| Missing | 229685 |
| Missing (%) | 11.4% |
| Memory size | 15.4 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 24 |
| Mean length | 22.81198618 |
| Min length | 10 |
Characters and Unicode
| Total characters | 40800606 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 6 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 149817 ? |
|---|---|
| Unique (%) | 8.4% |
Sample
| 1st row | (40.667202, -73.8665) |
|---|---|
| 2nd row | (40.683304, -73.917274) |
| 3rd row | (40.709183, -73.956825) |
| 4th row | (40.86816, -73.83148) |
| 5th row | (40.67172, -73.8971) |
| Value | Count | Frequency (%) |
| 0.0 | 8470 | 0.2% |
| 40.861862 | 853 | < 0.1% |
| 40.696033 | 742 | < 0.1% |
| 73.89063 | 738 | < 0.1% |
| 73.91282 | 717 | < 0.1% |
| 73.98453 | 698 | < 0.1% |
| 40.8047 | 691 | < 0.1% |
| 74.038086 | 672 | < 0.1% |
| 40.608757 | 671 | < 0.1% |
| 73.91243 | 652 | < 0.1% |
| Other values (223568) | 3562216 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 4470331 | |
| 4 | 3869198 | 9.5% |
| . | 3577120 | 8.8% |
| 3 | 3403654 | 8.3% |
| 0 | 3307996 | 8.1% |
| 9 | 2629709 | 6.4% |
| 8 | 2577886 | 6.3% |
| 6 | 2544516 | 6.2% |
| 5 | 2037315 | 5.0% |
| ( | 1788560 | 4.4% |
| Other values (6) | 10594321 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 28284921 | |
| Other Punctuation | 5365680 | 13.2% |
| Open Punctuation | 1788560 | 4.4% |
| Space Separator | 1788560 | 4.4% |
| Close Punctuation | 1788560 | 4.4% |
| Dash Punctuation | 1784325 | 4.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 4470331 | |
| 4 | 3869198 | |
| 3 | 3403654 | |
| 0 | 3307996 | |
| 9 | 2629709 | |
| 8 | 2577886 | |
| 6 | 2544516 | |
| 5 | 2037315 | |
| 2 | 1739489 | 6.1% |
| 1 | 1704827 | 6.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3577120 | |
| , | 1788560 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1788560 |
Space Separator
| Value | Count | Frequency (%) |
| 1788560 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1788560 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1784325 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 40800606 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 7 | 4470331 | |
| 4 | 3869198 | 9.5% |
| . | 3577120 | 8.8% |
| 3 | 3403654 | 8.3% |
| 0 | 3307996 | 8.1% |
| 9 | 2629709 | 6.4% |
| 8 | 2577886 | 6.3% |
| 6 | 2544516 | 6.2% |
| 5 | 2037315 | 5.0% |
| ( | 1788560 | 4.4% |
| Other values (6) | 10594321 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40800606 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | 4470331 | |
| 4 | 3869198 | 9.5% |
| . | 3577120 | 8.8% |
| 3 | 3403654 | 8.3% |
| 0 | 3307996 | 8.1% |
| 9 | 2629709 | 6.4% |
| 8 | 2577886 | 6.3% |
| 6 | 2544516 | 6.2% |
| 5 | 2037315 | 5.0% |
| ( | 1788560 | 4.4% |
| Other values (6) | 10594321 |
ON STREET NAME
Text
MISSING 
| Distinct | 17990 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 424807 |
| Missing (%) | 21.0% |
| Memory size | 15.4 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 30.02577948 |
| Min length | 2 |
Characters and Unicode
| Total characters | 47844218 |
|---|---|
| Distinct characters | 75 |
| Distinct categories | 10 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6346 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | WHITESTONE EXPRESSWAY |
|---|---|
| 2nd row | QUEENSBORO BRIDGE UPPER |
| 3rd row | THROGS NECK BRIDGE |
| 4th row | SARATOGA AVENUE |
| 5th row | MAJOR DEEGAN EXPRESSWAY RAMP |
| Value | Count | Frequency (%) |
| avenue | 593450 | 16.1% |
| street | 509147 | 13.9% |
| east | 150248 | 4.1% |
| boulevard | 124118 | 3.4% |
| west | 112399 | 3.1% |
| parkway | 71852 | 2.0% |
| road | 66512 | 1.8% |
| expressway | 60732 | 1.7% |
| island | 29161 | 0.8% |
| queens | 26387 | 0.7% |
| Other values (5367) | 1931980 |
Most occurring characters
| Value | Count | Frequency (%) |
| 27508136 | ||
| E | 3581653 | 7.5% |
| A | 1899395 | 4.0% |
| T | 1789078 | 3.7% |
| R | 1624994 | 3.4% |
| N | 1389933 | 2.9% |
| S | 1371656 | 2.9% |
| U | 953525 | 2.0% |
| O | 846204 | 1.8% |
| V | 830996 | 1.7% |
| Other values (65) | 6048648 | 12.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 27508136 | |
| Uppercase Letter | 19062728 | |
| Decimal Number | 1147165 | 2.4% |
| Lowercase Letter | 115400 | 0.2% |
| Other Punctuation | 4436 | < 0.1% |
| Open Punctuation | 3091 | < 0.1% |
| Close Punctuation | 3087 | < 0.1% |
| Dash Punctuation | 173 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
| Control | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 3581653 | |
| A | 1899395 | |
| T | 1789078 | |
| R | 1624994 | 8.5% |
| N | 1389933 | 7.3% |
| S | 1371656 | 7.2% |
| U | 953525 | 5.0% |
| O | 846204 | 4.4% |
| V | 830996 | 4.4% |
| L | 625583 | 3.3% |
| Other values (16) | 4149711 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 15435 | |
| r | 10192 | 8.8% |
| n | 9717 | 8.4% |
| a | 9624 | 8.3% |
| t | 8411 | 7.3% |
| s | 7077 | 6.1% |
| o | 6798 | 5.9% |
| y | 5680 | 4.9% |
| l | 5346 | 4.6% |
| d | 4456 | 3.9% |
| Other values (16) | 32664 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 260820 | |
| 3 | 129868 | |
| 2 | 128207 | |
| 4 | 108794 | |
| 5 | 106506 | |
| 6 | 93244 | 8.1% |
| 8 | 86291 | 7.5% |
| 7 | 84717 | 7.4% |
| 9 | 75661 | 6.6% |
| 0 | 73057 | 6.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3275 | |
| / | 1024 | 23.1% |
| & | 62 | 1.4% |
| ' | 37 | 0.8% |
| # | 16 | 0.4% |
| , | 16 | 0.4% |
| @ | 6 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 27508136 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3091 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3087 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 173 |
Math Symbol
| Value | Count | Frequency (%) |
| > | 1 |
Control
| Value | Count | Frequency (%) |
| | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 28666090 | |
| Latin | 19178128 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 3581653 | |
| A | 1899395 | |
| T | 1789078 | |
| R | 1624994 | 8.5% |
| N | 1389933 | 7.2% |
| S | 1371656 | 7.2% |
| U | 953525 | 5.0% |
| O | 846204 | 4.4% |
| V | 830996 | 4.3% |
| L | 625583 | 3.3% |
| Other values (42) | 4265111 |
Common
| Value | Count | Frequency (%) |
| 27508136 | ||
| 1 | 260820 | 0.9% |
| 3 | 129868 | 0.5% |
| 2 | 128207 | 0.4% |
| 4 | 108794 | 0.4% |
| 5 | 106506 | 0.4% |
| 6 | 93244 | 0.3% |
| 8 | 86291 | 0.3% |
| 7 | 84717 | 0.3% |
| 9 | 75661 | 0.3% |
| Other values (13) | 83846 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 47844218 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 27508136 | ||
| E | 3581653 | 7.5% |
| A | 1899395 | 4.0% |
| T | 1789078 | 3.7% |
| R | 1624994 | 3.4% |
| N | 1389933 | 2.9% |
| S | 1371656 | 2.9% |
| U | 953525 | 2.0% |
| O | 846204 | 1.8% |
| V | 830996 | 1.7% |
| Other values (65) | 6048648 | 12.6% |
MISSING 
| Distinct | 20039 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 755532 |
| Missing (%) | 37.4% |
| Memory size | 15.4 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 22.92086008 |
| Min length | 1 |
Characters and Unicode
| Total characters | 28942468 |
|---|---|
| Distinct characters | 76 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 6127 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | 20 AVENUE |
|---|---|
| 2nd row | DECATUR STREET |
| 3rd row | EAST 43 STREET |
| 4th row | EAST GATE PLAZA |
| 5th row | west 80 street -west 81 street |
| Value | Count | Frequency (%) |
| avenue | 552823 | 19.8% |
| street | 449789 | 16.1% |
| east | 109830 | 3.9% |
| west | 70043 | 2.5% |
| boulevard | 66991 | 2.4% |
| road | 54298 | 1.9% |
| place | 33223 | 1.2% |
| parkway | 25933 | 0.9% |
| 3 | 18440 | 0.7% |
| park | 17087 | 0.6% |
| Other values (5466) | 1394699 |
Most occurring characters
| Value | Count | Frequency (%) |
| 14081661 | ||
| E | 2872978 | 9.9% |
| T | 1422548 | 4.9% |
| A | 1387639 | 4.8% |
| R | 1121792 | 3.9% |
| N | 1050486 | 3.6% |
| S | 967745 | 3.3% |
| U | 759721 | 2.6% |
| V | 692944 | 2.4% |
| O | 565260 | 2.0% |
| Other values (66) | 4019694 | 13.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 14081661 | |
| Uppercase Letter | 13751087 | |
| Decimal Number | 1048182 | 3.6% |
| Lowercase Letter | 61192 | 0.2% |
| Other Punctuation | 307 | < 0.1% |
| Dash Punctuation | 27 | < 0.1% |
| Open Punctuation | 3 | < 0.1% |
| Close Punctuation | 3 | < 0.1% |
| Control | 2 | < 0.1% |
| Math Symbol | 2 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 2872978 | |
| T | 1422548 | |
| A | 1387639 | |
| R | 1121792 | 8.2% |
| N | 1050486 | 7.6% |
| S | 967745 | 7.0% |
| U | 759721 | 5.5% |
| V | 692944 | 5.0% |
| O | 565260 | 4.1% |
| L | 427447 | 3.1% |
| Other values (16) | 2482527 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 11430 | |
| t | 6384 | |
| a | 6003 | |
| r | 5041 | 8.2% |
| n | 4340 | 7.1% |
| s | 4008 | 6.5% |
| o | 2923 | 4.8% |
| v | 2850 | 4.7% |
| u | 2502 | 4.1% |
| l | 2194 | 3.6% |
| Other values (16) | 13517 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 232122 | |
| 2 | 123492 | |
| 3 | 115117 | |
| 4 | 94619 | |
| 5 | 94388 | |
| 8 | 83322 | 7.9% |
| 7 | 83178 | 7.9% |
| 6 | 82662 | 7.9% |
| 9 | 71902 | 6.9% |
| 0 | 67380 | 6.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 127 | |
| . | 71 | |
| & | 52 | |
| ' | 51 | |
| ? | 3 | 1.0% |
| , | 3 | 1.0% |
Space Separator
| Value | Count | Frequency (%) |
| 14081661 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 27 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3 |
Control
| Value | Count | Frequency (%) |
| | 2 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 2 |
Other Symbol
| Value | Count | Frequency (%) |
| � | 1 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 15130189 | |
| Latin | 13812279 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 2872978 | |
| T | 1422548 | |
| A | 1387639 | |
| R | 1121792 | 8.1% |
| N | 1050486 | 7.6% |
| S | 967745 | 7.0% |
| U | 759721 | 5.5% |
| V | 692944 | 5.0% |
| O | 565260 | 4.1% |
| L | 427447 | 3.1% |
| Other values (42) | 2543719 |
Common
| Value | Count | Frequency (%) |
| 14081661 | ||
| 1 | 232122 | 1.5% |
| 2 | 123492 | 0.8% |
| 3 | 115117 | 0.8% |
| 4 | 94619 | 0.6% |
| 5 | 94388 | 0.6% |
| 8 | 83322 | 0.6% |
| 7 | 83178 | 0.5% |
| 6 | 82662 | 0.5% |
| 9 | 71902 | 0.5% |
| Other values (14) | 67726 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28942467 | |
| Specials | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 14081661 | ||
| E | 2872978 | 9.9% |
| T | 1422548 | 4.9% |
| A | 1387639 | 4.8% |
| R | 1121792 | 3.9% |
| N | 1050486 | 3.6% |
| S | 967745 | 3.3% |
| U | 759721 | 2.6% |
| V | 692944 | 2.4% |
| O | 565260 | 2.0% |
| Other values (65) | 4019693 | 13.9% |
Specials
| Value | Count | Frequency (%) |
| � | 1 |
OFF STREET NAME
Text
MISSING 
| Distinct | 215352 |
|---|---|
| Distinct (%) | 64.8% |
| Missing | 1685810 |
| Missing (%) | 83.5% |
| Memory size | 15.4 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 40 |
| Mean length | 36.62444388 |
| Min length | 8 |
Characters and Unicode
| Total characters | 12175247 |
|---|---|
| Distinct characters | 84 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 168284 ? |
|---|---|
| Unique (%) | 50.6% |
Sample
| 1st row | 1211 LORING AVENUE |
|---|---|
| 2nd row | 344 BAYCHESTER AVENUE |
| 3rd row | 2047 PITKIN AVENUE |
| 4th row | 480 DEAN STREET |
| 5th row | 878 FLATBUSH AVENUE |
| Value | Count | Frequency (%) |
| avenue | 131712 | 11.9% |
| street | 119606 | 10.8% |
| east | 31610 | 2.9% |
| west | 22819 | 2.1% |
| boulevard | 21244 | 1.9% |
| road | 15677 | 1.4% |
| lot | 7881 | 0.7% |
| parking | 7267 | 0.7% |
| of | 6915 | 0.6% |
| parkway | 6580 | 0.6% |
| Other values (27326) | 736012 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6748483 | ||
| E | 760214 | 6.2% |
| T | 416075 | 3.4% |
| A | 391170 | 3.2% |
| R | 324575 | 2.7% |
| N | 285899 | 2.3% |
| S | 272712 | 2.2% |
| 1 | 263953 | 2.2% |
| U | 194023 | 1.6% |
| O | 181774 | 1.5% |
| Other values (74) | 2336369 | 19.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 6748483 | |
| Uppercase Letter | 3928966 | |
| Decimal Number | 1381320 | 11.3% |
| Dash Punctuation | 78402 | 0.6% |
| Lowercase Letter | 23864 | 0.2% |
| Other Punctuation | 9578 | 0.1% |
| Open Punctuation | 2311 | < 0.1% |
| Close Punctuation | 2300 | < 0.1% |
| Modifier Symbol | 17 | < 0.1% |
| Connector Punctuation | 3 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 760214 | |
| T | 416075 | |
| A | 391170 | |
| R | 324575 | |
| N | 285899 | 7.3% |
| S | 272712 | 6.9% |
| U | 194023 | 4.9% |
| O | 181774 | 4.6% |
| V | 181309 | 4.6% |
| L | 137102 | 3.5% |
| Other values (16) | 784113 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3971 | |
| t | 2786 | |
| r | 2231 | |
| a | 2094 | 8.8% |
| n | 1570 | 6.6% |
| s | 1551 | 6.5% |
| o | 1262 | 5.3% |
| v | 1026 | 4.3% |
| d | 963 | 4.0% |
| l | 954 | 4.0% |
| Other values (16) | 5456 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 6431 | |
| & | 1740 | 18.2% |
| . | 1001 | 10.5% |
| @ | 145 | 1.5% |
| , | 83 | 0.9% |
| : | 59 | 0.6% |
| # | 54 | 0.6% |
| ' | 50 | 0.5% |
| * | 8 | 0.1% |
| ? | 3 | < 0.1% |
| Other values (2) | 4 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 263953 | |
| 2 | 179334 | |
| 0 | 155779 | |
| 3 | 140846 | |
| 5 | 139658 | |
| 4 | 123406 | |
| 6 | 100790 | 7.3% |
| 7 | 98482 | 7.1% |
| 8 | 92988 | 6.7% |
| 9 | 86084 | 6.2% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2299 | |
| ] | 1 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 1 | ||
| | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 6748483 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 78402 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2311 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 17 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8222417 | |
| Latin | 3952830 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 760214 | |
| T | 416075 | |
| A | 391170 | |
| R | 324575 | |
| N | 285899 | 7.2% |
| S | 272712 | 6.9% |
| U | 194023 | 4.9% |
| O | 181774 | 4.6% |
| V | 181309 | 4.6% |
| L | 137102 | 3.5% |
| Other values (42) | 807977 |
Common
| Value | Count | Frequency (%) |
| 6748483 | ||
| 1 | 263953 | 3.2% |
| 2 | 179334 | 2.2% |
| 0 | 155779 | 1.9% |
| 3 | 140846 | 1.7% |
| 5 | 139658 | 1.7% |
| 4 | 123406 | 1.5% |
| 6 | 100790 | 1.2% |
| 7 | 98482 | 1.2% |
| 8 | 92988 | 1.1% |
| Other values (22) | 178698 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12175247 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6748483 | ||
| E | 760214 | 6.2% |
| T | 416075 | 3.4% |
| A | 391170 | 3.2% |
| R | 324575 | 2.7% |
| N | 285899 | 2.3% |
| S | 272712 | 2.2% |
| 1 | 263953 | 2.2% |
| U | 194023 | 1.6% |
| O | 181774 | 1.5% |
| Other values (74) | 2336369 | 19.2% |
NUMBER OF PERSONS INJURED
Real number (ℝ)
ZEROS 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 18 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3024248511 |
| Minimum | 0 |
|---|---|
| Maximum | 43 |
| Zeros | 1568357 |
| Zeros (%) | 77.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 43 |
| Range | 43 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.6937633069 |
|---|---|
| Coefficient of variation (CV) | 2.29400231 |
| Kurtosis | 52.81943505 |
| Mean | 0.3024248511 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.322086162 |
| Sum | 610362 |
| Variance | 0.481307526 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1568357 | |
| 1 | 349056 | 17.3% |
| 2 | 65772 | 3.3% |
| 3 | 21490 | 1.1% |
| 4 | 8002 | 0.4% |
| 5 | 3107 | 0.2% |
| 6 | 1285 | 0.1% |
| 7 | 552 | < 0.1% |
| 8 | 243 | < 0.1% |
| 9 | 120 | < 0.1% |
| Other values (21) | 243 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1568357 | |
| 1 | 349056 | 17.3% |
| 2 | 65772 | 3.3% |
| 3 | 21490 | 1.1% |
| 4 | 8002 | 0.4% |
| Value | Count | Frequency (%) |
| 43 | 1 | |
| 40 | 1 | |
| 34 | 1 | |
| 32 | 1 | |
| 31 | 1 |
NUMBER OF PERSONS KILLED
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 31 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.001446328288 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 2015410 |
| Zeros (%) | 99.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.04007201236 |
|---|---|
| Coefficient of variation (CV) | 27.70602821 |
| Kurtosis | 1973.851717 |
| Mean | 0.001446328288 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 34.05808743 |
| Sum | 2919 |
| Variance | 0.001605766174 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2015410 | |
| 1 | 2716 | 0.1% |
| 2 | 71 | < 0.1% |
| 3 | 12 | < 0.1% |
| 4 | 3 | < 0.1% |
| 8 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| (Missing) | 31 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2015410 | |
| 1 | 2716 | 0.1% |
| 2 | 71 | < 0.1% |
| 3 | 12 | < 0.1% |
| 4 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 8 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 4 | 3 | < 0.1% |
| 3 | 12 | < 0.1% |
| 2 | 71 |
NUMBER OF PEDESTRIANS INJURED
Real number (ℝ)
ZEROS 
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.05518507416 |
| Minimum | 0 |
|---|---|
| Maximum | 27 |
| Zeros | 1911465 |
| Zeros (%) | 94.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 27 |
| Range | 27 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.2412866552 |
|---|---|
| Coefficient of variation (CV) | 4.372317314 |
| Kurtosis | 137.052183 |
| Mean | 0.05518507416 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.801458899 |
| Sum | 111377 |
| Variance | 0.05821925 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1911465 | |
| 1 | 102865 | 5.1% |
| 2 | 3466 | 0.2% |
| 3 | 344 | < 0.1% |
| 4 | 59 | < 0.1% |
| 5 | 25 | < 0.1% |
| 6 | 11 | < 0.1% |
| 7 | 3 | < 0.1% |
| 9 | 2 | < 0.1% |
| 27 | 1 | < 0.1% |
| Other values (4) | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1911465 | |
| 1 | 102865 | 5.1% |
| 2 | 3466 | 0.2% |
| 3 | 344 | < 0.1% |
| 4 | 59 | < 0.1% |
| Value | Count | Frequency (%) |
| 27 | 1 | |
| 19 | 1 | |
| 15 | 1 | |
| 13 | 1 | |
| 9 | 2 |
NUMBER OF PEDESTRIANS KILLED
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0007253826964 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 2016798 |
| Zeros (%) | 99.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.02741555777 |
|---|---|
| Coefficient of variation (CV) | 37.79461229 |
| Kurtosis | 2555.389013 |
| Mean | 0.0007253826964 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 41.90421138 |
| Sum | 1464 |
| Variance | 0.0007516128078 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2016798 | |
| 1 | 1434 | 0.1% |
| 2 | 12 | < 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2016798 | |
| 1 | 1434 | 0.1% |
| 2 | 12 | < 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 6 | 1 | < 0.1% |
| 2 | 12 | < 0.1% |
| 1 | 1434 | 0.1% |
| 0 | 2016798 |
NUMBER OF CYCLIST INJURED
Real number (ℝ)
ZEROS 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.02612467763 |
| Minimum | 0 |
|---|---|
| Maximum | 4 |
| Zeros | 1966117 |
| Zeros (%) | 97.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 4 |
| Range | 4 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.1614266671 |
|---|---|
| Coefficient of variation (CV) | 6.179087429 |
| Kurtosis | 38.34362648 |
| Mean | 0.02612467763 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.17799342 |
| Sum | 52726 |
| Variance | 0.02605856886 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1966117 | |
| 1 | 51553 | 2.6% |
| 2 | 553 | < 0.1% |
| 3 | 21 | < 0.1% |
| 4 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1966117 | |
| 1 | 51553 | 2.6% |
| 2 | 553 | < 0.1% |
| 3 | 21 | < 0.1% |
| 4 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 4 | 1 | < 0.1% |
| 3 | 21 | < 0.1% |
| 2 | 553 | < 0.1% |
| 1 | 51553 | 2.6% |
| 0 | 1966117 |
NUMBER OF CYCLIST KILLED
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0001119784763 |
| Minimum | 0 |
|---|---|
| Maximum | 2 |
| Zeros | 2018020 |
| Zeros (%) | > 99.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 2 |
| Range | 2 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.01062812086 |
|---|---|
| Coefficient of variation (CV) | 94.91217608 |
| Kurtosis | 9312.90116 |
| Mean | 0.0001119784763 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 95.71982564 |
| Sum | 226 |
| Variance | 0.0001129569531 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2018020 | |
| 1 | 224 | < 0.1% |
| 2 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2018020 | |
| 1 | 224 | < 0.1% |
| 2 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2 | 1 | < 0.1% |
| 1 | 224 | < 0.1% |
| 0 | 2018020 |
NUMBER OF MOTORIST INJURED
Real number (ℝ)
ZEROS 
| Distinct | 30 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2179888963 |
| Minimum | 0 |
|---|---|
| Maximum | 43 |
| Zeros | 1730540 |
| Zeros (%) | 85.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 43 |
| Range | 43 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.6549699539 |
|---|---|
| Coefficient of variation (CV) | 3.004602368 |
| Kurtosis | 65.55690898 |
| Mean | 0.2179888963 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.193745228 |
| Sum | 439955 |
| Variance | 0.4289856406 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1730540 | |
| 1 | 193508 | 9.6% |
| 2 | 60096 | 3.0% |
| 3 | 20851 | 1.0% |
| 4 | 7843 | 0.4% |
| 5 | 3056 | 0.2% |
| 6 | 1240 | 0.1% |
| 7 | 527 | < 0.1% |
| 8 | 234 | < 0.1% |
| 9 | 116 | < 0.1% |
| Other values (20) | 234 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1730540 | |
| 1 | 193508 | 9.6% |
| 2 | 60096 | 3.0% |
| 3 | 20851 | 1.0% |
| 4 | 7843 | 0.4% |
| Value | Count | Frequency (%) |
| 43 | 1 | |
| 40 | 1 | |
| 34 | 1 | |
| 31 | 1 | |
| 30 | 1 |
NUMBER OF MOTORIST KILLED
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0005896211808 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 2017144 |
| Zeros (%) | 99.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.02648116975 |
|---|---|
| Coefficient of variation (CV) | 44.91217517 |
| Kurtosis | 4042.602393 |
| Mean | 0.0005896211808 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 54.57753588 |
| Sum | 1190 |
| Variance | 0.0007012523514 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2017144 | |
| 1 | 1031 | 0.1% |
| 2 | 55 | < 0.1% |
| 3 | 12 | < 0.1% |
| 4 | 2 | < 0.1% |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2017144 | |
| 1 | 1031 | 0.1% |
| 2 | 55 | < 0.1% |
| 3 | 12 | < 0.1% |
| 4 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 1 | < 0.1% |
| 4 | 2 | < 0.1% |
| 3 | 12 | < 0.1% |
| 2 | 55 | < 0.1% |
| 1 | 1031 |
| Distinct | 61 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 6348 |
| Missing (%) | 0.3% |
| Memory size | 15.4 MiB |
Length
| Max length | 53 |
|---|---|
| Median length | 43 |
| Mean length | 19.45329905 |
| Min length | 1 |
Characters and Unicode
| Total characters | 39138034 |
|---|---|
| Distinct characters | 55 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Aggressive Driving/Road Rage |
|---|---|
| 2nd row | Pavement Slippery |
| 3rd row | Following Too Closely |
| 4th row | Unspecified |
| 5th row | Unspecified |
| Value | Count | Frequency (%) |
| unspecified | 692736 | |
| driver | 432536 | 10.8% |
| inattention/distraction | 401262 | 10.0% |
| too | 157315 | 3.9% |
| closely | 157315 | 3.9% |
| to | 143244 | 3.6% |
| failure | 125196 | 3.1% |
| yield | 119166 | 3.0% |
| right-of-way | 119166 | 3.0% |
| following | 107467 | 2.7% |
| Other values (96) | 1540479 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 4413995 | 11.3% |
| e | 3990869 | 10.2% |
| n | 3401289 | 8.7% |
| t | 2707853 | 6.9% |
| o | 2302244 | 5.9% |
| r | 2289239 | 5.8% |
| s | 2040012 | 5.2% |
| 1983985 | 5.1% | |
| a | 1925256 | 4.9% |
| c | 1515443 | 3.9% |
| Other values (45) | 12567849 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 31977468 | |
| Uppercase Letter | 4423958 | 11.3% |
| Space Separator | 1983985 | 5.1% |
| Other Punctuation | 508165 | 1.3% |
| Dash Punctuation | 240030 | 0.6% |
| Open Punctuation | 2108 | < 0.1% |
| Close Punctuation | 2108 | < 0.1% |
| Decimal Number | 212 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 4413995 | |
| e | 3990869 | |
| n | 3401289 | |
| t | 2707853 | |
| o | 2302244 | 7.2% |
| r | 2289239 | 7.2% |
| s | 2040012 | 6.4% |
| a | 1925256 | 6.0% |
| c | 1515443 | 4.7% |
| l | 1207962 | 3.8% |
| Other values (15) | 6183306 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 976406 | |
| U | 910860 | |
| I | 569111 | |
| F | 287918 | 6.5% |
| C | 276081 | 6.2% |
| T | 246158 | 5.6% |
| P | 178719 | 4.0% |
| R | 162979 | 3.7% |
| L | 129923 | 2.9% |
| W | 120252 | 2.7% |
| Other values (12) | 565551 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 101 | |
| 0 | 101 | |
| 1 | 10 | 4.7% |
Space Separator
| Value | Count | Frequency (%) |
| 1983985 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 508165 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 240030 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2108 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2108 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 36401426 | |
| Common | 2736608 | 7.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 4413995 | |
| e | 3990869 | 11.0% |
| n | 3401289 | 9.3% |
| t | 2707853 | 7.4% |
| o | 2302244 | 6.3% |
| r | 2289239 | 6.3% |
| s | 2040012 | 5.6% |
| a | 1925256 | 5.3% |
| c | 1515443 | 4.2% |
| l | 1207962 | 3.3% |
| Other values (37) | 10607264 |
Common
| Value | Count | Frequency (%) |
| 1983985 | ||
| / | 508165 | 18.6% |
| - | 240030 | 8.8% |
| ( | 2108 | 0.1% |
| ) | 2108 | 0.1% |
| 8 | 101 | < 0.1% |
| 0 | 101 | < 0.1% |
| 1 | 10 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 39138034 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 4413995 | 11.3% |
| e | 3990869 | 10.2% |
| n | 3401289 | 8.7% |
| t | 2707853 | 6.9% |
| o | 2302244 | 5.9% |
| r | 2289239 | 5.8% |
| s | 2040012 | 5.2% |
| 1983985 | 5.1% | |
| a | 1925256 | 4.9% |
| c | 1515443 | 3.9% |
| Other values (45) | 12567849 |
CONTRIBUTING FACTOR VEHICLE 2
Text
MISSING 
| Distinct | 61 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 307909 |
| Missing (%) | 15.3% |
| Memory size | 15.4 MiB |
Length
| Max length | 53 |
|---|---|
| Median length | 11 |
| Mean length | 13.0438674 |
| Min length | 1 |
Characters and Unicode
| Total characters | 22309396 |
|---|---|
| Distinct characters | 55 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
| Value | Count | Frequency (%) |
| unspecified | 1440015 | |
| driver | 98225 | 4.7% |
| inattention/distraction | 91712 | 4.4% |
| other | 32434 | 1.5% |
| vehicular | 31373 | 1.5% |
| too | 26844 | 1.3% |
| closely | 26844 | 1.3% |
| to | 21040 | 1.0% |
| passing | 20921 | 1.0% |
| lane | 19538 | 0.9% |
| Other values (96) | 288274 | 13.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 3517830 | |
| e | 3423220 | |
| n | 1999138 | |
| s | 1714454 | |
| c | 1625212 | |
| d | 1511422 | |
| p | 1507909 | |
| f | 1494364 | |
| U | 1475432 | |
| t | 603217 | 2.7% |
| Other values (45) | 3437198 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 19575504 | |
| Uppercase Letter | 2196560 | 9.8% |
| Space Separator | 386884 | 1.7% |
| Other Punctuation | 115746 | 0.5% |
| Dash Punctuation | 34091 | 0.2% |
| Open Punctuation | 281 | < 0.1% |
| Close Punctuation | 281 | < 0.1% |
| Decimal Number | 49 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 3517830 | |
| e | 3423220 | |
| n | 1999138 | |
| s | 1714454 | |
| c | 1625212 | |
| d | 1511422 | |
| p | 1507909 | |
| f | 1494364 | |
| t | 603217 | 3.1% |
| r | 526160 | 2.7% |
| Other values (15) | 1652578 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 1475432 | |
| D | 218605 | 10.0% |
| I | 123089 | 5.6% |
| C | 51127 | 2.3% |
| F | 47323 | 2.2% |
| T | 43230 | 2.0% |
| O | 43189 | 2.0% |
| V | 40393 | 1.8% |
| P | 36364 | 1.7% |
| L | 27874 | 1.3% |
| Other values (12) | 89934 | 4.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 22 | |
| 0 | 22 | |
| 1 | 5 | 10.2% |
Space Separator
| Value | Count | Frequency (%) |
| 386884 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 115746 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 34091 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 281 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 281 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 21772064 | |
| Common | 537332 | 2.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 3517830 | |
| e | 3423220 | |
| n | 1999138 | |
| s | 1714454 | |
| c | 1625212 | |
| d | 1511422 | |
| p | 1507909 | |
| f | 1494364 | |
| U | 1475432 | |
| t | 603217 | 2.8% |
| Other values (37) | 2899866 |
Common
| Value | Count | Frequency (%) |
| 386884 | ||
| / | 115746 | 21.5% |
| - | 34091 | 6.3% |
| ( | 281 | 0.1% |
| ) | 281 | 0.1% |
| 8 | 22 | < 0.1% |
| 0 | 22 | < 0.1% |
| 1 | 5 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 22309396 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 3517830 | |
| e | 3423220 | |
| n | 1999138 | |
| s | 1714454 | |
| c | 1625212 | |
| d | 1511422 | |
| p | 1507909 | |
| f | 1494364 | |
| U | 1475432 | |
| t | 603217 | 2.7% |
| Other values (45) | 3437198 |
CONTRIBUTING FACTOR VEHICLE 3
Text
MISSING 
| Distinct | 51 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1875114 |
| Missing (%) | 92.9% |
| Memory size | 15.4 MiB |
Length
| Max length | 53 |
|---|---|
| Median length | 11 |
| Mean length | 11.65463107 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1668139 |
|---|---|
| Distinct characters | 55 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
| Value | Count | Frequency (%) |
| unspecified | 133444 | |
| other | 2706 | 1.7% |
| vehicular | 2666 | 1.7% |
| driver | 2060 | 1.3% |
| too | 1909 | 1.2% |
| closely | 1909 | 1.2% |
| inattention/distraction | 1885 | 1.2% |
| following | 1859 | 1.2% |
| fatigued/drowsy | 853 | 0.5% |
| pavement | 394 | 0.3% |
| Other values (79) | 5691 | 3.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 285087 | |
| i | 283874 | |
| n | 146282 | |
| s | 140137 | |
| c | 139607 | |
| d | 135503 | |
| p | 135033 | |
| f | 134321 | |
| U | 134069 | |
| o | 16543 | 1.0% |
| Other values (45) | 117683 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1494483 | |
| Uppercase Letter | 158061 | 9.5% |
| Space Separator | 12245 | 0.7% |
| Other Punctuation | 3016 | 0.2% |
| Dash Punctuation | 303 | < 0.1% |
| Open Punctuation | 12 | < 0.1% |
| Close Punctuation | 12 | < 0.1% |
| Decimal Number | 7 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 285087 | |
| i | 283874 | |
| n | 146282 | |
| s | 140137 | |
| c | 139607 | |
| d | 135503 | |
| p | 135033 | |
| f | 134321 | |
| o | 16543 | 1.1% |
| t | 15513 | 1.0% |
| Other values (15) | 62583 | 4.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 134069 | |
| D | 5387 | 3.4% |
| O | 3026 | 1.9% |
| F | 2952 | 1.9% |
| V | 2940 | 1.9% |
| I | 2387 | 1.5% |
| C | 2381 | 1.5% |
| T | 2163 | 1.4% |
| P | 670 | 0.4% |
| S | 532 | 0.3% |
| Other values (12) | 1554 | 1.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 3 | |
| 0 | 3 | |
| 1 | 1 | 14.3% |
Space Separator
| Value | Count | Frequency (%) |
| 12245 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 3016 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 303 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 12 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1652544 | |
| Common | 15595 | 0.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 285087 | |
| i | 283874 | |
| n | 146282 | |
| s | 140137 | |
| c | 139607 | |
| d | 135503 | |
| p | 135033 | |
| f | 134321 | |
| U | 134069 | |
| o | 16543 | 1.0% |
| Other values (37) | 102088 | 6.2% |
Common
| Value | Count | Frequency (%) |
| 12245 | ||
| / | 3016 | 19.3% |
| - | 303 | 1.9% |
| ( | 12 | 0.1% |
| ) | 12 | 0.1% |
| 8 | 3 | < 0.1% |
| 0 | 3 | < 0.1% |
| 1 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1668139 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 285087 | |
| i | 283874 | |
| n | 146282 | |
| s | 140137 | |
| c | 139607 | |
| d | 135503 | |
| p | 135033 | |
| f | 134321 | |
| U | 134069 | |
| o | 16543 | 1.0% |
| Other values (45) | 117683 |
CONTRIBUTING FACTOR VEHICLE 4
Text
MISSING 
| Distinct | 41 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1986122 |
| Missing (%) | 98.4% |
| Memory size | 15.4 MiB |
Length
| Max length | 43 |
|---|---|
| Median length | 11 |
| Mean length | 11.48706534 |
| Min length | 5 |
Characters and Unicode
| Total characters | 368999 |
|---|---|
| Distinct characters | 51 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
| Value | Count | Frequency (%) |
| unspecified | 30317 | |
| other | 584 | 1.7% |
| vehicular | 575 | 1.7% |
| too | 374 | 1.1% |
| closely | 374 | 1.1% |
| following | 369 | 1.1% |
| driver | 293 | 0.9% |
| inattention/distraction | 266 | 0.8% |
| fatigued/drowsy | 170 | 0.5% |
| pavement | 113 | 0.3% |
| Other values (64) | 911 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 64025 | |
| i | 63468 | |
| n | 32294 | |
| c | 31426 | |
| s | 31411 | |
| d | 30663 | |
| p | 30657 | |
| f | 30435 | |
| U | 30412 | |
| o | 2938 | 0.8% |
| Other values (41) | 21270 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 331505 | |
| Uppercase Letter | 34758 | 9.4% |
| Space Separator | 2223 | 0.6% |
| Other Punctuation | 471 | 0.1% |
| Dash Punctuation | 34 | < 0.1% |
| Open Punctuation | 4 | < 0.1% |
| Close Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 64025 | |
| i | 63468 | |
| n | 32294 | |
| c | 31426 | |
| s | 31411 | |
| d | 30663 | |
| p | 30657 | |
| f | 30435 | |
| o | 2938 | 0.9% |
| r | 2638 | 0.8% |
| Other values (15) | 11550 | 3.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 30412 | |
| D | 837 | 2.4% |
| O | 637 | 1.8% |
| V | 621 | 1.8% |
| F | 583 | 1.7% |
| C | 434 | 1.2% |
| T | 403 | 1.2% |
| I | 336 | 1.0% |
| S | 139 | 0.4% |
| P | 136 | 0.4% |
| Other values (11) | 220 | 0.6% |
Space Separator
| Value | Count | Frequency (%) |
| 2223 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 471 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 34 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 366263 | |
| Common | 2736 | 0.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 64025 | |
| i | 63468 | |
| n | 32294 | |
| c | 31426 | |
| s | 31411 | |
| d | 30663 | |
| p | 30657 | |
| f | 30435 | |
| U | 30412 | |
| o | 2938 | 0.8% |
| Other values (36) | 18534 | 5.1% |
Common
| Value | Count | Frequency (%) |
| 2223 | ||
| / | 471 | 17.2% |
| - | 34 | 1.2% |
| ( | 4 | 0.1% |
| ) | 4 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 368999 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 64025 | |
| i | 63468 | |
| n | 32294 | |
| c | 31426 | |
| s | 31411 | |
| d | 30663 | |
| p | 30657 | |
| f | 30435 | |
| U | 30412 | |
| o | 2938 | 0.8% |
| Other values (41) | 21270 | 5.8% |
CONTRIBUTING FACTOR VEHICLE 5
Text
MISSING 
| Distinct | 30 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 2009575 |
| Missing (%) | 99.6% |
| Memory size | 15.4 MiB |
Length
| Max length | 43 |
|---|---|
| Median length | 11 |
| Mean length | 11.46758939 |
| Min length | 5 |
Characters and Unicode
| Total characters | 99424 |
|---|---|
| Distinct characters | 50 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
| Value | Count | Frequency (%) |
| unspecified | 8177 | |
| other | 168 | 1.8% |
| vehicular | 166 | 1.8% |
| too | 91 | 1.0% |
| closely | 91 | 1.0% |
| following | 89 | 1.0% |
| driver | 73 | 0.8% |
| inattention/distraction | 63 | 0.7% |
| pavement | 48 | 0.5% |
| slippery | 47 | 0.5% |
| Other values (47) | 247 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 17320 | |
| i | 17093 | |
| n | 8685 | |
| c | 8481 | |
| s | 8434 | |
| p | 8299 | |
| d | 8262 | |
| f | 8204 | |
| U | 8200 | |
| o | 729 | 0.7% |
| Other values (40) | 5717 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 89345 | |
| Uppercase Letter | 9359 | 9.4% |
| Space Separator | 590 | 0.6% |
| Other Punctuation | 115 | 0.1% |
| Dash Punctuation | 11 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 17320 | |
| i | 17093 | |
| n | 8685 | |
| c | 8481 | |
| s | 8434 | |
| p | 8299 | |
| d | 8262 | |
| f | 8204 | |
| o | 729 | 0.8% |
| r | 715 | 0.8% |
| Other values (15) | 3123 | 3.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 8200 | |
| D | 204 | 2.2% |
| O | 184 | 2.0% |
| V | 179 | 1.9% |
| F | 142 | 1.5% |
| C | 103 | 1.1% |
| T | 97 | 1.0% |
| I | 87 | 0.9% |
| S | 57 | 0.6% |
| P | 51 | 0.5% |
| Other values (10) | 55 | 0.6% |
Space Separator
| Value | Count | Frequency (%) |
| 590 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 115 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 11 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 98704 | |
| Common | 720 | 0.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 17320 | |
| i | 17093 | |
| n | 8685 | |
| c | 8481 | |
| s | 8434 | |
| p | 8299 | |
| d | 8262 | |
| f | 8204 | |
| U | 8200 | |
| o | 729 | 0.7% |
| Other values (35) | 4997 | 5.1% |
Common
| Value | Count | Frequency (%) |
| 590 | ||
| / | 115 | 16.0% |
| - | 11 | 1.5% |
| ( | 2 | 0.3% |
| ) | 2 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 99424 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 17320 | |
| i | 17093 | |
| n | 8685 | |
| c | 8481 | |
| s | 8434 | |
| p | 8299 | |
| d | 8262 | |
| f | 8204 | |
| U | 8200 | |
| o | 729 | 0.7% |
| Other values (40) | 5717 | 5.8% |
COLLISION_ID
Real number (ℝ)
UNIQUE 
| Distinct | 2018245 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3116454.661 |
| Minimum | 22 |
|---|---|
| Maximum | 4655026 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.4 MiB |
Quantile statistics
| Minimum | 22 |
|---|---|
| 5-th percentile | 101751.2 |
| Q1 | 3140681 |
| median | 3645346 |
| Q3 | 4150156 |
| 95-th percentile | 4553890.8 |
| Maximum | 4655026 |
| Range | 4655004 |
| Interquartile range (IQR) | 1009475 |
Descriptive statistics
| Standard deviation | 1503996.846 |
|---|---|
| Coefficient of variation (CV) | 0.4825986609 |
| Kurtosis | -0.1124875456 |
| Mean | 3116454.661 |
| Median Absolute Deviation (MAD) | 504738 |
| Skewness | -1.204533211 |
| Sum | 6.289769037 × 1012 |
| Variance | 2.262006513 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4455765 | 1 | < 0.1% |
| 3215029 | 1 | < 0.1% |
| 3210593 | 1 | < 0.1% |
| 3210501 | 1 | < 0.1% |
| 3218613 | 1 | < 0.1% |
| 3212622 | 1 | < 0.1% |
| 3224369 | 1 | < 0.1% |
| 3213701 | 1 | < 0.1% |
| 3228991 | 1 | < 0.1% |
| 3224246 | 1 | < 0.1% |
| Other values (2018235) | 2018235 |
| Value | Count | Frequency (%) |
| 22 | 1 | |
| 23 | 1 | |
| 24 | 1 | |
| 25 | 1 | |
| 26 | 1 |
| Value | Count | Frequency (%) |
| 4655026 | 1 | |
| 4655023 | 1 | |
| 4655021 | 1 | |
| 4655019 | 1 | |
| 4655016 | 1 |
| Distinct | 1562 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 12677 |
| Missing (%) | 0.6% |
| Memory size | 15.4 MiB |
Length
| Max length | 38 |
|---|---|
| Median length | 35 |
| Mean length | 16.90862938 |
| Min length | 1 |
Characters and Unicode
| Total characters | 33911406 |
|---|---|
| Distinct characters | 75 |
| Distinct categories | 11 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 944 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Sedan |
|---|---|
| 2nd row | Sedan |
| 3rd row | Sedan |
| 4th row | Sedan |
| 5th row | Dump |
| Value | Count | Frequency (%) |
| vehicle | 860450 | |
| utility | 613996 | |
| station | 613956 | |
| sedan | 593505 | |
| wagon/sport | 433665 | |
| passenger | 416217 | |
| 181583 | 3.8% | |
| wagon | 180349 | 3.8% |
| sport | 180291 | 3.8% |
| truck | 83089 | 1.7% |
| Other values (918) | 604471 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2769224 | 8.2% | |
| S | 2669549 | 7.9% |
| t | 2199918 | 6.5% |
| i | 1853656 | 5.5% |
| E | 1817948 | 5.4% |
| a | 1551343 | 4.6% |
| e | 1540361 | 4.5% |
| n | 1481788 | 4.4% |
| o | 1372162 | 4.0% |
| T | 1136684 | 3.4% |
| Other values (65) | 15518773 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 15401306 | |
| Lowercase Letter | 14949349 | |
| Space Separator | 2769224 | 8.2% |
| Other Punctuation | 615298 | 1.8% |
| Decimal Number | 70965 | 0.2% |
| Dash Punctuation | 50031 | 0.1% |
| Open Punctuation | 27616 | 0.1% |
| Close Punctuation | 27613 | 0.1% |
| Modifier Symbol | 2 | < 0.1% |
| Other Symbol | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2669549 | |
| E | 1817948 | |
| T | 1136684 | 7.4% |
| I | 1051994 | 6.8% |
| V | 933305 | 6.1% |
| A | 874896 | 5.7% |
| N | 865293 | 5.6% |
| R | 723411 | 4.7% |
| U | 675992 | 4.4% |
| L | 667524 | 4.3% |
| Other values (16) | 3984710 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 2199918 | |
| i | 1853656 | |
| a | 1551343 | |
| e | 1540361 | |
| n | 1481788 | |
| o | 1372162 | |
| l | 902170 | |
| d | 641319 | 4.3% |
| r | 600544 | 4.0% |
| c | 574275 | 3.8% |
| Other values (15) | 2231813 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 53397 | |
| 6 | 14403 | 20.3% |
| 2 | 2675 | 3.8% |
| 3 | 321 | 0.5% |
| 1 | 55 | 0.1% |
| 5 | 42 | 0.1% |
| 0 | 36 | 0.1% |
| 9 | 20 | < 0.1% |
| 8 | 9 | < 0.1% |
| 7 | 7 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 615272 | |
| . | 13 | < 0.1% |
| # | 6 | < 0.1% |
| , | 3 | < 0.1% |
| ' | 2 | < 0.1% |
| ? | 1 | < 0.1% |
| & | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2769224 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 50031 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 27616 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 27613 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 2 |
Other Symbol
| Value | Count | Frequency (%) |
| � | 1 |
Control
| Value | Count | Frequency (%) |
| | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 30350655 | |
| Common | 3560751 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 2669549 | 8.8% |
| t | 2199918 | 7.2% |
| i | 1853656 | 6.1% |
| E | 1817948 | 6.0% |
| a | 1551343 | 5.1% |
| e | 1540361 | 5.1% |
| n | 1481788 | 4.9% |
| o | 1372162 | 4.5% |
| T | 1136684 | 3.7% |
| I | 1051994 | 3.5% |
| Other values (41) | 13675252 |
Common
| Value | Count | Frequency (%) |
| 2769224 | ||
| / | 615272 | 17.3% |
| 4 | 53397 | 1.5% |
| - | 50031 | 1.4% |
| ( | 27616 | 0.8% |
| ) | 27613 | 0.8% |
| 6 | 14403 | 0.4% |
| 2 | 2675 | 0.1% |
| 3 | 321 | < 0.1% |
| 1 | 55 | < 0.1% |
| Other values (14) | 144 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 33911405 | |
| Specials | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2769224 | 8.2% | |
| S | 2669549 | 7.9% |
| t | 2199918 | 6.5% |
| i | 1853656 | 5.5% |
| E | 1817948 | 5.4% |
| a | 1551343 | 4.6% |
| e | 1540361 | 4.5% |
| n | 1481788 | 4.4% |
| o | 1372162 | 4.0% |
| T | 1136684 | 3.4% |
| Other values (64) | 15518772 |
Specials
| Value | Count | Frequency (%) |
| � | 1 |
MISSING 
| Distinct | 1739 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 376990 |
| Missing (%) | 18.7% |
| Memory size | 15.4 MiB |
Length
| Max length | 38 |
|---|---|
| Median length | 30 |
| Mean length | 16.11302509 |
| Min length | 1 |
Characters and Unicode
| Total characters | 26445583 |
|---|---|
| Distinct characters | 72 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1034 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Sedan |
|---|---|
| 2nd row | Pick-up Truck |
| 3rd row | Sedan |
| 4th row | Tractor Truck Diesel |
| 5th row | Sedan |
| Value | Count | Frequency (%) |
| vehicle | 642409 | |
| utility | 455446 | |
| station | 455422 | |
| sedan | 420741 | |
| passenger | 318610 | |
| wagon/sport | 315218 | |
| 141437 | 3.8% | |
| wagon | 140256 | 3.7% |
| sport | 140204 | 3.7% |
| truck | 82388 | 2.2% |
| Other values (971) | 643153 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2126995 | 8.0% | |
| S | 1993078 | 7.5% |
| t | 1607094 | 6.1% |
| E | 1437058 | 5.4% |
| i | 1380800 | 5.2% |
| e | 1145297 | 4.3% |
| a | 1125945 | 4.3% |
| n | 1069195 | 4.0% |
| o | 1020212 | 3.9% |
| T | 915148 | 3.5% |
| Other values (62) | 12624761 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 12576139 | |
| Lowercase Letter | 11123251 | |
| Space Separator | 2126995 | 8.0% |
| Other Punctuation | 456724 | 1.7% |
| Decimal Number | 59135 | 0.2% |
| Dash Punctuation | 50036 | 0.2% |
| Open Punctuation | 26652 | 0.1% |
| Close Punctuation | 26649 | 0.1% |
| Modifier Symbol | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1993078 | |
| E | 1437058 | |
| T | 915148 | 7.3% |
| N | 869229 | 6.9% |
| I | 842049 | 6.7% |
| V | 709350 | 5.6% |
| A | 685071 | 5.4% |
| O | 587828 | 4.7% |
| R | 577759 | 4.6% |
| U | 573837 | 4.6% |
| Other values (16) | 3385732 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1607094 | |
| i | 1380800 | |
| e | 1145297 | |
| a | 1125945 | |
| n | 1069195 | |
| o | 1020212 | |
| l | 661121 | 5.9% |
| r | 470682 | 4.2% |
| d | 458514 | 4.1% |
| c | 449963 | 4.0% |
| Other values (15) | 1734428 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 43057 | |
| 6 | 13694 | 23.2% |
| 2 | 1958 | 3.3% |
| 3 | 285 | 0.5% |
| 0 | 53 | 0.1% |
| 1 | 41 | 0.1% |
| 5 | 27 | < 0.1% |
| 9 | 8 | < 0.1% |
| 8 | 7 | < 0.1% |
| 7 | 5 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 456704 | |
| . | 11 | < 0.1% |
| ' | 3 | < 0.1% |
| , | 2 | < 0.1% |
| # | 2 | < 0.1% |
| ? | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2126995 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 50036 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 26652 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 26649 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23699390 | |
| Common | 2746193 | 10.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 1993078 | 8.4% |
| t | 1607094 | 6.8% |
| E | 1437058 | 6.1% |
| i | 1380800 | 5.8% |
| e | 1145297 | 4.8% |
| a | 1125945 | 4.8% |
| n | 1069195 | 4.5% |
| o | 1020212 | 4.3% |
| T | 915148 | 3.9% |
| N | 869229 | 3.7% |
| Other values (41) | 11136334 |
Common
| Value | Count | Frequency (%) |
| 2126995 | ||
| / | 456704 | 16.6% |
| - | 50036 | 1.8% |
| 4 | 43057 | 1.6% |
| ( | 26652 | 1.0% |
| ) | 26649 | 1.0% |
| 6 | 13694 | 0.5% |
| 2 | 1958 | 0.1% |
| 3 | 285 | < 0.1% |
| 0 | 53 | < 0.1% |
| Other values (11) | 110 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26445583 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2126995 | 8.0% | |
| S | 1993078 | 7.5% |
| t | 1607094 | 6.1% |
| E | 1437058 | 5.4% |
| i | 1380800 | 5.2% |
| e | 1145297 | 4.3% |
| a | 1125945 | 4.3% |
| n | 1069195 | 4.0% |
| o | 1020212 | 3.9% |
| T | 915148 | 3.5% |
| Other values (62) | 12624761 |
MISSING 
| Distinct | 246 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1880098 |
| Missing (%) | 93.2% |
| Memory size | 15.4 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 30 |
| Mean length | 17.68346037 |
| Min length | 2 |
Characters and Unicode
| Total characters | 2442917 |
|---|---|
| Distinct characters | 62 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 142 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Sedan |
|---|---|
| 2nd row | Station Wagon/Sport Utility Vehicle |
| 3rd row | Sedan |
| 4th row | Sedan |
| 5th row | Sedan |
| Value | Count | Frequency (%) |
| vehicle | 62326 | |
| utility | 47537 | |
| station | 47535 | |
| sedan | 44904 | |
| wagon/sport | 34176 | |
| passenger | 27716 | |
| 13436 | 4.0% | |
| sport | 13358 | 4.0% |
| wagon | 13358 | 4.0% |
| truck | 4094 | 1.2% |
| Other values (201) | 27838 |
Most occurring characters
| Value | Count | Frequency (%) |
| 198566 | 8.1% | |
| S | 194466 | 8.0% |
| t | 172204 | 7.0% |
| i | 142271 | 5.8% |
| a | 116663 | 4.8% |
| E | 116377 | 4.8% |
| e | 116165 | 4.8% |
| n | 114066 | 4.7% |
| o | 105313 | 4.3% |
| T | 76669 | 3.1% |
| Other values (52) | 1090157 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1127807 | |
| Uppercase Letter | 1060635 | |
| Space Separator | 198566 | 8.1% |
| Other Punctuation | 47613 | 1.9% |
| Decimal Number | 3640 | 0.1% |
| Dash Punctuation | 2904 | 0.1% |
| Open Punctuation | 876 | < 0.1% |
| Close Punctuation | 876 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 194466 | |
| E | 116377 | |
| T | 76669 | 7.2% |
| I | 71395 | 6.7% |
| N | 65711 | 6.2% |
| V | 65393 | 6.2% |
| A | 57912 | 5.5% |
| U | 52569 | 5.0% |
| W | 50920 | 4.8% |
| O | 46577 | 4.4% |
| Other values (15) | 262646 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 172204 | |
| i | 142271 | |
| a | 116663 | |
| e | 116165 | |
| n | 114066 | |
| o | 105313 | |
| l | 69668 | |
| d | 47840 | 4.2% |
| r | 42374 | 3.8% |
| c | 41169 | 3.7% |
| Other values (14) | 160074 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 2998 | |
| 6 | 442 | 12.1% |
| 2 | 185 | 5.1% |
| 3 | 10 | 0.3% |
| 8 | 2 | 0.1% |
| 1 | 1 | < 0.1% |
| 0 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 198566 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 47613 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2904 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 876 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 876 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2188442 | |
| Common | 254475 | 10.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 194466 | 8.9% |
| t | 172204 | 7.9% |
| i | 142271 | 6.5% |
| a | 116663 | 5.3% |
| E | 116377 | 5.3% |
| e | 116165 | 5.3% |
| n | 114066 | 5.2% |
| o | 105313 | 4.8% |
| T | 76669 | 3.5% |
| I | 71395 | 3.3% |
| Other values (39) | 962853 |
Common
| Value | Count | Frequency (%) |
| 198566 | ||
| / | 47613 | 18.7% |
| 4 | 2998 | 1.2% |
| - | 2904 | 1.1% |
| ( | 876 | 0.3% |
| ) | 876 | 0.3% |
| 6 | 442 | 0.2% |
| 2 | 185 | 0.1% |
| 3 | 10 | < 0.1% |
| 8 | 2 | < 0.1% |
| Other values (3) | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2442917 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 198566 | 8.1% | |
| S | 194466 | 8.0% |
| t | 172204 | 7.0% |
| i | 142271 | 5.8% |
| a | 116663 | 4.8% |
| E | 116377 | 4.8% |
| e | 116165 | 4.8% |
| n | 114066 | 4.7% |
| o | 105313 | 4.3% |
| T | 76669 | 3.1% |
| Other values (52) | 1090157 |
MISSING 
| Distinct | 99 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 1987193 |
| Missing (%) | 98.5% |
| Memory size | 15.4 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 30 |
| Mean length | 17.95169393 |
| Min length | 2 |
Characters and Unicode
| Total characters | 557436 |
|---|---|
| Distinct characters | 57 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 43 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Station Wagon/Sport Utility Vehicle |
|---|---|
| 2nd row | Sedan |
| 3rd row | Station Wagon/Sport Utility Vehicle |
| 4th row | Sedan |
| 5th row | Sedan |
| Value | Count | Frequency (%) |
| vehicle | 14336 | |
| utility | 11162 | |
| station | 11162 | |
| sedan | 10798 | |
| wagon/sport | 8310 | |
| passenger | 5969 | |
| 2859 | 3.8% | |
| sport | 2852 | 3.8% |
| wagon | 2852 | 3.8% |
| truck | 744 | 1.0% |
| Other values (101) | 4929 | 6.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 44977 | 8.1% | |
| S | 44695 | 8.0% |
| t | 41751 | 7.5% |
| i | 34281 | 6.1% |
| a | 28042 | 5.0% |
| e | 27840 | 5.0% |
| n | 27549 | 4.9% |
| o | 25366 | 4.6% |
| E | 24666 | 4.4% |
| l | 16836 | 3.0% |
| Other values (47) | 241433 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 270351 | |
| Uppercase Letter | 229384 | |
| Space Separator | 44977 | 8.1% |
| Other Punctuation | 11169 | 2.0% |
| Decimal Number | 726 | 0.1% |
| Dash Punctuation | 601 | 0.1% |
| Open Punctuation | 114 | < 0.1% |
| Close Punctuation | 114 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 44695 | |
| E | 24666 | |
| T | 15986 | 7.0% |
| I | 15047 | 6.6% |
| V | 14816 | 6.5% |
| N | 13718 | 6.0% |
| A | 12213 | 5.3% |
| U | 12053 | 5.3% |
| W | 11770 | 5.1% |
| O | 9650 | 4.2% |
| Other values (14) | 54770 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 41751 | |
| i | 34281 | |
| a | 28042 | |
| e | 27840 | |
| n | 27549 | |
| o | 25366 | |
| l | 16836 | |
| d | 11436 | 4.2% |
| r | 9863 | 3.6% |
| c | 9621 | 3.6% |
| Other values (13) | 37766 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 623 | |
| 6 | 58 | 8.0% |
| 2 | 42 | 5.8% |
| 3 | 2 | 0.3% |
| 5 | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 44977 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 11169 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 601 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 114 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 114 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 499735 | |
| Common | 57701 | 10.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 44695 | 8.9% |
| t | 41751 | 8.4% |
| i | 34281 | 6.9% |
| a | 28042 | 5.6% |
| e | 27840 | 5.6% |
| n | 27549 | 5.5% |
| o | 25366 | 5.1% |
| E | 24666 | 4.9% |
| l | 16836 | 3.4% |
| T | 15986 | 3.2% |
| Other values (37) | 212723 |
Common
| Value | Count | Frequency (%) |
| 44977 | ||
| / | 11169 | 19.4% |
| 4 | 623 | 1.1% |
| - | 601 | 1.0% |
| ( | 114 | 0.2% |
| ) | 114 | 0.2% |
| 6 | 58 | 0.1% |
| 2 | 42 | 0.1% |
| 3 | 2 | < 0.1% |
| 5 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 557436 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 44977 | 8.1% | |
| S | 44695 | 8.0% |
| t | 41751 | 7.5% |
| i | 34281 | 6.1% |
| a | 28042 | 5.0% |
| e | 27840 | 5.0% |
| n | 27549 | 4.9% |
| o | 25366 | 4.6% |
| E | 24666 | 4.4% |
| l | 16836 | 3.0% |
| Other values (47) | 241433 |
MISSING 
| Distinct | 67 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 2009835 |
| Missing (%) | 99.6% |
| Memory size | 15.4 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 30 |
| Mean length | 18.23008323 |
| Min length | 2 |
Characters and Unicode
| Total characters | 153315 |
|---|---|
| Distinct characters | 54 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 28 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Station Wagon/Sport Utility Vehicle |
|---|---|
| 2nd row | Station Wagon/Sport Utility Vehicle |
| 3rd row | Sedan |
| 4th row | Sedan |
| 5th row | Station Wagon/Sport Utility Vehicle |
| Value | Count | Frequency (%) |
| vehicle | 3859 | |
| station | 3165 | |
| utility | 3165 | |
| sedan | 2989 | |
| wagon/sport | 2363 | |
| passenger | 1487 | 7.1% |
| 804 | 3.9% | |
| wagon | 804 | 3.9% |
| sport | 802 | 3.8% |
| truck | 233 | 1.1% |
| Other values (63) | 1164 | 5.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 12435 | 8.1% | |
| S | 12204 | 8.0% |
| t | 11882 | 7.8% |
| i | 9751 | 6.4% |
| a | 7881 | 5.1% |
| e | 7836 | 5.1% |
| n | 7768 | 5.1% |
| o | 7235 | 4.7% |
| E | 6126 | 4.0% |
| l | 4789 | 3.1% |
| Other values (44) | 65408 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 76600 | |
| Uppercase Letter | 60724 | |
| Space Separator | 12435 | 8.1% |
| Other Punctuation | 3167 | 2.1% |
| Dash Punctuation | 182 | 0.1% |
| Decimal Number | 161 | 0.1% |
| Close Punctuation | 23 | < 0.1% |
| Open Punctuation | 23 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 12204 | |
| E | 6126 | |
| T | 4486 | 7.4% |
| I | 4008 | 6.6% |
| V | 3967 | 6.5% |
| N | 3429 | 5.6% |
| U | 3336 | 5.5% |
| W | 3264 | 5.4% |
| A | 3211 | 5.3% |
| O | 2625 | 4.3% |
| Other values (13) | 14068 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 11882 | |
| i | 9751 | |
| a | 7881 | |
| e | 7836 | |
| n | 7768 | |
| o | 7235 | |
| l | 4789 | |
| d | 3134 | 4.1% |
| r | 2797 | 3.7% |
| c | 2783 | 3.6% |
| Other values (12) | 10744 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 133 | |
| 2 | 14 | 8.7% |
| 6 | 13 | 8.1% |
| 3 | 1 | 0.6% |
Space Separator
| Value | Count | Frequency (%) |
| 12435 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 3167 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 182 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 23 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 23 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 137324 | |
| Common | 15991 | 10.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 12204 | 8.9% |
| t | 11882 | 8.7% |
| i | 9751 | 7.1% |
| a | 7881 | 5.7% |
| e | 7836 | 5.7% |
| n | 7768 | 5.7% |
| o | 7235 | 5.3% |
| E | 6126 | 4.5% |
| l | 4789 | 3.5% |
| T | 4486 | 3.3% |
| Other values (35) | 57366 |
Common
| Value | Count | Frequency (%) |
| 12435 | ||
| / | 3167 | 19.8% |
| - | 182 | 1.1% |
| 4 | 133 | 0.8% |
| ) | 23 | 0.1% |
| ( | 23 | 0.1% |
| 2 | 14 | 0.1% |
| 6 | 13 | 0.1% |
| 3 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 153315 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 12435 | 8.1% | |
| S | 12204 | 8.0% |
| t | 11882 | 7.8% |
| i | 9751 | 6.4% |
| a | 7881 | 5.1% |
| e | 7836 | 5.1% |
| n | 7768 | 5.1% |
| o | 7235 | 4.7% |
| E | 6126 | 4.0% |
| l | 4789 | 3.1% |
| Other values (44) | 65408 |